Search CORE

460 research outputs found

The genetic organisation of prokaryotic two-component system signalling pathways

Author: Robert HN Williams
David E Whitworth
RB Bourret
C Fabret
T Mizuno
JM Skerker
K Yamamoto
MT Laub
DE Whitworth
L Li
M Weigt
L Burger
L Løvdok
PJ Piggot
R Paul
S Jagadeesan
S Wegener-Feldbrügge
PJA Cock
PI Higgs
DE Whitworth
N Majdalani
S Romagnoli
LE Ulrich
M Barakat
MY Galperin
MY Galperin
DE Whitworth
MY Galperin
MY Galperin
DE Whitworth
PJ Cock
DE Whitworth
PJA Cock
A Pallejà
Y Fukuda
PJA Cock
S Schübbe
JL Appleby
P Dam
M Pertea
I Macarthur
KA Walker
W Zhang
S Romagnoli
A Busch
LE Ulrich
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background Two-component systems (TCSs) are modular and diverse signalling pathways, involving a stimulus-responsive transfer of phosphoryl groups from transmitter to partner receiver domains. TCS gene and domain organisation are both potentially informative regarding biological function, interaction partnerships and molecular mechanisms. However, there is currently little understanding of the relationships between domain architecture, gene organisation and TCS pathway structure. Results Here we classify the gene and domain organisation of TCS gene loci from 1405 prokaryotic replicons (>40,000 TCS proteins). We find that 200 bp is the most appropriate distance cut-off for defining whether two TCS genes are functionally linked. More than 90% of all TCS gene loci encode just one or two transmitter and/or receiver domains, however numerous other geometries exist, often with large numbers of encoded TCS domains. Such information provides insights into the distribution of TCS domains between genes, and within genes. As expected, the organisation of TCS genes and domains is affected by phylogeny, and plasmid-encoded TCS exhibit differences in organisation from their chromosomally-encoded counterparts. Conclusions We provide here an overview of the genomic and genetic organisation of TCS domains, as a resource for further research. We also propose novel metrics that build upon TCS gene/domain organisation data and allow comparisons between genomic complements of TCSs. In particular, '<it>percentage orphaned TCS genes</it>' (or 'Dissemination') and '<it>percentage of complex loci</it>' (or 'Sophistication') appear to be useful discriminators, and to reflect mechanistic aspects of TCS organisation not captured by existing metrics.</p

Crossref

Aberystwyth Research Portal

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Online Research Database In Technology

RegExpBlasting (REB), a Regular Expression Blasting algorithm based on multiply aligned sequences

Author: AD Baxevanis
AM Arigon
E Becker
F Rubino
Francesco Rubino
M Accetturo
M Attimonelli
Marcella Attimonelli
MY Galperin
PD Hebert
Q Wang
SF Altschul
WR Pearson
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Background: One of the most frequent uses of bioinformatics tools concerns functional characterization of a newly produced nucleotide sequence (a query sequence) by applying Blast or FASTA against a set of sequences (the subject sequences). However, in some specific contexts, it is useful to compare the query sequence against a cluster such as a MultiAlignment (MA). We present here the RegExpBlasting (REB) algorithm, which compares an unclassified sequence with a dataset of patterns defined by application of Regular Expression rules to a given-as-input MA datasets. The REB algorithm workflow consists in i. the definition of a dataset of multialignments ii. the association of each MA to a pattern, defined by application of regular expression rules; iii. automatic characterization of a submitted biosequence according to the function of the sequences described by the pattern best matching the query sequence. Results: An application of this algorithm is used in the "characterize your sequence" tool available in the PPNEMA resource. PPNEMA is a resource of Ribosomal Cistron sequences from various species, grouped according to nematode genera. It allows the retrieval of plant nematode multialigned sequences or the classification of new nematode rDNA sequences by applying REB. The same algorithm also supports automatic updating of the PPNEMA database. The present paper gives examples of the use of REB within PPNEMA. Conclusion: The use of REB in PPNEMA updating, the PPNEMA "characterize your sequence" option clearly demonstrates the power of the method. Using REB can also rapidly solve any other bioinformatics problem, where the addition of a new sequence to a pre-existing cluster is required. The statistical tests carried out here show the powerful flexibility of the method

Queen's University Belfast Research Portal

Crossref

Springer - Publisher Connector

PubMed Central

Archivio istituzionale della ricerca - Università di Bari

Rapid pair-wise synteny analysis of large bacterial genomes using web-based GeneOrder4.0

Author: A Kaluszka
C Bru
Donald Seto
H Tettelin
J Tamames
MY Galperin
Padmanabhan Mahadevan
R Lavigne
R Lavigne
R Mazumder
R Overbeek
S Celamkoti
WJ Kent
Y Zheng
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

Tumor taxonomy for the developmental lineage classification of neoplasms

Author: BL Humphreys
DM Baorto
E Mayr
F Marti'n-Sanchez
JJ Berman
JJ Berman
Jules J Berman
K Ahmed
LD Stein
MA Harris
MN Cantor
MY Galperin
P Zweigenbaum
PA Covitz
SH Walsh
Publication venue: BioMed Central
Publication date: 01/11/2004
Field of study

BACKGROUND: The new "Developmental lineage classification of neoplasms" was described in a prior publication. The classification is simple (the entire hierarchy is described with just 39 classifiers), comprehensive (providing a place for every tumor of man), and consistent with recent attempts to characterize tumors by cytogenetic and molecular features. A taxonomy is a list of the instances that populate a classification. The taxonomy of neoplasia attempts to list every known term for every known tumor of man. METHODS: The taxonomy provides each concept with a unique code and groups synonymous terms under the same concept. A Perl script validated successive drafts of the taxonomy ensuring that: 1) each term occurs only once in the taxonomy; 2) each term occurs in only one tumor class; 3) each concept code occurs in one and only one hierarchical position in the classification; and 4) the file containing the classification and taxonomy is a well-formed XML (eXtensible Markup Language) document. RESULTS: The taxonomy currently contains 122,632 different terms encompassing 5,376 neoplasm concepts. Each concept has, on average, 23 synonyms. The taxonomy populates "The developmental lineage classification of neoplasms," and is available as an XML file, currently 9+ Megabytes in length. A representation of the classification/taxonomy listing each term followed by its code, followed by its full ancestry, is available as a flat-file, 19+ Megabytes in length. The taxonomy is the largest nomenclature of neoplasms, with more than twice the number of neoplasm names found in other medical nomenclatures, including the 2004 version of the Unified Medical Language System, the Systematized Nomenclature of Medicine Clinical Terminology, the National Cancer Institute's Thesaurus, and the International Classification of Diseases Oncolology version. CONCLUSIONS: This manuscript describes a comprehensive taxonomy of neoplasia that collects synonymous terms under a unique code number and assigns each tumor to a single class within the tumor hierarchy. The entire classification and taxonomy are available as open access files (in XML and flat-file formats) with this article

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Automatically extracting functionally equivalent proteins from SwissProt

Author: A Amores
A Meyer
A Wagner
AA Akindahunsi
Andrew CR Martin
CH Wu
E Kretschmann
EJ Stellwag
EV Koonin
F Chen
GX Yu
II Artamonova
JM Hurst
KP O'Brien
LB Koski
Lisa EM McMillan
MC Lill
MY Galperin
RA Notebaart
RL Tatusov
RL Tatusov
S Shibata
SB Rice
SF Altschul
T Hulsen
T Hulsen
V Kunin
V van Noort
WM Fitch
Y Lee
Y Yaron
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2008
Field of study

In summary, FOSTA provides an automated analysis of annotations in UniProtKB/Swiss-Prot to enable groups of proteins already annotated as functionally equivalent, to be extracted. Our results demonstrate that the vast majority of UniProtKB/Swiss-Prot functional annotations are of high quality, and that FOSTA can interpret annotations successfully. Where FOSTA is not successful, we are able to highlight inconsistencies in UniProtKB/Swiss-Prot annotation. Most of these would have presented equal difficulties for manual interpretation of annotations. We discuss limitations and possible future extensions to FOSTA, and recommend changes to the UniProtKB/Swiss-Prot format, which would facilitate text-mining of UniProtKB/Swiss-Prot

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

UCL Discovery

PubMed Central

Enlighten

epiPATH: an information system for the storage and management of molecular epidemiology data from infectious pathogens

Author: Alicia Amadoz
B Louie
EF Codd
Fernando González-Candelas
J Nielsen
J Nielsen
J Nielsen
M Torres-Puente
ML Barreto
MR Nelson
MY Galperin
N Jiménez Hernández
PJ Morris
SL Nuismer
USFD Administration
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

Detection of putative new mutacins by bioinformatic analysis using available web tools

Author: A Dufour
D Ajdić
DH Haft
DH Haft
DI Andersson
GG Nicolas
Guillaume G Nicolas
HB Shen
IF Nes
JD Hale
L Smith
LA Martin-Visscher
M Begley
M Ibrahim
M Kleerebezem
MA Riley
MY Galperin
PA Wescombe
S Lata
SF Altschul
SW Lee
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

In order to characterise new bacteriocins produced by Streptococcus mutans we perform a complete bioinformatic analyses by scanning the genome sequence of strains UA159 and NN2025. By searching in the adjacent genomic context of the two-component signal transduction system we predicted the existence of many putative new bacteriocins' maturation pathways and some of them were only exclusive to a group of Streptococcus. Computational genomic and proteomic analysis combined to predictive functionnal analysis represent an alternative way for rapid identification of new putative bacteriocins as well as new potential antimicrobial drugs compared to the more traditional methods of drugs discovery using antagonism tests

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Novel cyclic di-GMP effectors of the YajQ protein family control bacterial virulence

Author: A Teplyakov
C Momany
C Saveanu
CD Boyd
CR Guzzo
D-G Ha
David Mackey
Delphine L. Caly
F Tao
GG Anderson
H Mulcahy
H Slater
H Sondermann
IS Pultz
J Duevel
J Mansfield
J Nesper
J. Maxwell Dow
Joseph Ward
K-H Chin
KB Twomey
Melanie Febrer
MY Galperin
R Hengge
Robert P. Ryan
RP Ryan
RP Ryan
RP Ryan
RP Ryan
RP Ryan
RP Ryan
S Moreau-Marquis
S-Q An
S-Q An
S-Q An
Sarah L. Murdoch
SE Maddocks
Shi-qi An
T Lundback
T Schirmer
U Romling
X Qiao
X-H Lu
Y Fouhy
Y McCarthy
Y McCarthy
Yvonne McCarthy
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2014
Field of study

Bis-(3 ',5 ') cyclic di-guanylate (cyclic di-GMP) is a key bacterial second messenger that is implicated in the regulation of many critical processes that include motility, biofilm formation and virulence. Cyclic di-GMP influences diverse functions through interaction with a range of effectors. Our knowledge of these effectors and their different regulatory actions is far from complete, however. Here we have used an affinity pull-down assay using cyclic di-GMP-coupled magnetic beads to identify cyclic di-GMP binding proteins in the plant pathogen Xanthomonas campestris pv. campestris (Xcc). This analysis identified XC_3703, a protein of the YajQ family, as a potential cyclic di-GMP receptor. Isothermal titration calorimetry showed that the purified XC_3703 protein bound cyclic di-GMP with a high affinity (K-d similar to 2 mu M). Mutation of XC_3703 led to reduced virulence of Xcc to plants and alteration in biofilm formation. Yeast two-hybrid and far-western analyses showed that XC_3703 was able to interact with XC_2801, a transcription factor of the LysR family. Mutation of XC_2801 and XC_3703 had partially overlapping effects on the transcriptome of Xcc, and both affected virulence. Electromobility shift assays showed that XC_3703 positively affected the binding of XC_2801 to the promoters of target virulence genes, an effect that was reversed by cyclic di-GMP. Genetic and functional analysis of YajQ family members from the human pathogens Pseudomonas aeruginosa and Stenotrophomonas maltophilia showed that they also specifically bound cyclic di-GMP and contributed to virulence in model systems. The findings thus identify a new class of cyclic di-GMP effector that regulates bacterial virulence

Public Library of Science (PLOS)

Southampton (e-Prints Soton)

Crossref

Directory of Open Access Journals

Irish Universities

PubMed Central

Cork Open Research Archive

University of Dundee Online Publications

FigShare

Getting Started in Structural Phylogenomics

Author: B Qian
CE Jones
CM Zmasek
D Baker
D Brown
DJ Zwickl
ED Scheeff
F Delsuc
I Friedberg
JA Eisen
K Sjölander
Kimmen Sjölander
ML Green
MY Galperin
N Goldman
N Krishnamurthy
N Krishnamurthy
O Goldenberg
Olga Troyanskaya
RC Edgar
S Sankararaman
SE Brenner
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

Crossref

Directory of Open Access Journals

PubMed Central

Species-level functional profiling of metagenomes and metatranscriptomes.

Author: A Sczyrba
A Shafquat
AE Duran-Pinedo
AK Sharma
B Buchfink
B Langmead
BE Suzek
BK Swan
C Burke
C Luo
Curtis Huttenhower
D Medini
DH Huson
DT Truong
DT Truong
E Pasolli
EA Franzosa
EA Franzosa
Eric A. Franzosa
George Weingart
GG Silva
Gholamali Rahnavard
H Hauswedell
J Kim
J Lloyd-Price
J Lloyd-Price
J Ravel
J. Gregory Caporaso
JA Fuhrman
K Huang
Karen Schwarzberg Lipson
Lauren J. McIver
LR Thompson
LR Thompson
Luke R. Thompson
M Hamady
M Kanehisa
M Scholz
Melanie Schirmer
MY Galperin
N Segata
N Segata
Nicola Segata
OU Mason
P Petrenko
PJ Turnbaugh
R Caspi
RC Edgar
RD Finn
Rob Knight
S Abubucker
S Nayfach
S Sunagawa
S Sunagawa
T Bose
UniProt Consortium.
W Huang
Y Ye
Y Zhao
Publication venue: eScholarship, University of California
Publication date: 01/11/2018
Field of study

Functional profiles of microbial communities are typically generated using comprehensive metagenomic or metatranscriptomic sequence read searches, which are time-consuming, prone to spurious mapping, and often limited to community-level quantification. We developed HUMAnN2, a tiered search strategy that enables fast, accurate, and species-resolved functional profiling of host-associated and environmental communities. HUMAnN2 identifies a community's known species, aligns reads to their pangenomes, performs translated search on unclassified reads, and finally quantifies gene families and pathways. Relative to pure translated search, HUMAnN2 is faster and produces more accurate gene family profiles. We applied HUMAnN2 to study clinal variation in marine metabolism, ecological contribution patterns among human microbiome pathways, variation in species' genomic versus transcriptional contributions, and strain profiling. Further, we introduce 'contributional diversity' to explain patterns of ecological assembly across different microbial community types

Crossref

eScholarship - University of California